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Summary 

For normal canonical models with X ~ Np{9,a'^Ip), 5^ ~ '^^Xfc) independent, we consider the problem of 
estimating 6 under scale invariant squared error loss ^^^"f , when it is known that the signal-to- noise ratio 

^ is bounded above by m. Risk analysis is achieved by making use of a conditional risk decomposition 
^— I and we obtain in particular sufficient conditions for an estimator to dominate either the unbiased estimator 
5ub{X) = X, or the maximum likelihood estimator 6j^Iq{X, S'^), or both of these benchmark procedures. 
The given developments bring into play the pivotal role of the boundary Bayes estimator 5bu associated 
with a prior on {0,a) such that 9\a is uniformly distributed on the (boundary) sphere of radius m and 
a non-informative ^ prior measure is placed marginally on a. With a series of technical results related 
\Q to 6bu] which relate to particular ratios of confluent hypergeometric functions; we show that, whenever 
CN m < y/p and p >2, 5bu dominates both 5ub and S^^q- The finding can be viewed as both a multivariate 
extension of p = 1 result due to Kubokawa (2005) and a unknown variance extension of a similar dominance 
f-H finding due to Marchand and Perron (2001). Various other dominance results are obtained, illustrations 



are provided and commented upon. In particular, for m < y |, a wide class of Bayes estimators, which 
include priors where 6\a \s uniformly distributed on the ball of radius m, are shown to dominate Sub- 
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^ 1. Introduction 
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> 



1.1. The model 



*^ We consider the normal canonical model 

^ X r-. Np{9,a^Ip), ^2 _ or2^2^ independent, (1.1) 

with p > 1 and A; > 1, which plays a central role in both statistical theory and practice. We wish 
to estimate 6 under scale invariant squared error loss 

L((^^,a),d) = Mz^, (1.2) 
and where (6*, a) is restricted to the parameter space 

e(m) = l{e,a) e3?^x3fJ+:M<m,0<(T<oo|, (1.3) 
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where m is some known positive constant. Viewing a multivariate version of a signal to noise 

ratio, the problem can be described as estimating a normal mean with upper bounded signal to 
noise ratio. Alternatively, as previously described and analyzed by Kubokawa (2005) for p = 1, 
the parametric constraint places an upper bound on the reciprocal of the coefficient of variation in 
absolute value. 

We will be concerned with Bayesian inference in such restricted parameter space problems, which 
does not, conceptually, present any difficulties as both the prior and the resulting posterior (it it 
exists) will be adapted and will adapt to the constraints. Assessing the frequentist performance of 
Bayesian estimators in such situations is, however, considerably more challenging. Such assessments 
may include, for instance, testing for minimaxity, an evaluation in comparison to a benchmark 
procedure such as minimum risk equivariant (MRE) estimator or a maximum likelihood estimator 
(mle), or a study of the frequentist performance of associated Bayesian credible sets. 



Model (1.1) and the constraint (1.3) arise in normal full rank linear models Y ~ Nn{Z(3,a !„) 
with orthogonal design matrix Z{n x p); n > p; unknown parameter vector /3(p x 1), and with the 
constraint ^ < m. Indeed, the sufficient statistic = {Z'Zy^Z'Y = Z'Y, W = \\Y - Z^f) has 
a distribution which matches (1.1) with X = /3, S"^ = W, k = n — p accompanied by constraint 



(1.3) with 9 = (3. More generally, where Z is not necessarily orthogonal, the correspondence also 
arises as above through the constraint ^^^^ < m by setting X = (Z'Z)~^^'^(3 and 9 = {Z'Z)~^^'^(3. 

As considered and motivated by Kariya, Giri and Perron (1988), as well as Perron and Giri (1990), 



model (1.1) arises for the curved model setting 



Yi, . . . ,Yn independent Np{^, a'^Ip) , with cr^ = (1.4) 

A 

The correspondence is achieved by considering the sufficient statistic Y = ^ Y17=i ^ = ^'^(X]r=i O^i' 



Y){Yi - Y)') and setting X = ^/nY , S"^ = W , 9 = ^n/i, and = (n - l)p in (1.1 ). Constraint 



(1.3) arises by assuming A G [0,m] in (1.4), while the findings of Kariya, Giri and Perron (1988), 
as well as Perron and Giri (1 990) , relate to known A and the best equivariant estimator 5\{X,S) 
for loss ^^"^"^^^ (equivalent to 1.2) under the group of transformations G = Sf?"*" x 0{p), 3ft"'" being 



the multiplicative group of positive real numbers and 0{p) being the group of p x p orthogonal 
matrices. We will extract some features of the invariance structure and of the estimator S\{X, S) 
in Subsection 2.2, but refer to the above mentioned papers for additional details. 



1.2. The estimators and previous findings 

Two benchmark procedures are the unbiased estimator 6ub{X, S"^) = X and the maximum likeli- 
hood estimator ^jxile^"^' '^^)' Although both of the above procedures will be shown to be inadmis- 
sible (for p > 1 and p > 2 respectively), it is nevertheless of interest to describe the structure of 
improvements with an emphasis on potential Bayesian improvements. 

Actually, the choice 6ub{X, S"^) is clearly inefficient for our problem, as seen for instance by a 
straightforward analysis of the risk of affine linear estimators of the form Sa{X, S"^) = aX; a G 3ft. 
Indeed, the choices 5a with a G [^^^2^,1) dominate Sub{X,S'^) and the minimax procedure in 
this class, given by the choice a = ^^p , always dominates (strictly) SusiX, S'^). We will show 
below that ^^^^^(X, 5"^), which takes into account the constraint in opposition to 6ub{X, S'^), also 
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dominates 6ub{X, S*^) for all {m,p), so that it is more challenging to obtain improvements on the 
former as opposed to the latter. 

Remark 1.1. (Minimaxity) The above paragraph also implies that 6ub{X, S'^) is never minimax 
for our problem, in contrast with the case where 9, rather than -, is constrained to a ball of radius 



m and the loss is the same as in (1.2). Indeed, as shown by Kubokawa (2005) for p = 1 and as 
a consequence of a general minimax result given by Marchand and Strawderman (2011) applicable 
for p > 1, Sub{^,S'^) is minimax (but still inadmissible) for the restricted parameter space with 
\\9\\ < m and < a < oo . 

We denote Sh and Bh, respectively, as the p dimensional sphere and ball of radiuses h centered at the 
origin. We consider priors on the restricted parameter space G(m) admitting the representations: 

(i) 0\a^ ~ Uniform(5^<,),a2 ~ (a^)^-!, or (ii) e\a^ ~ Uniform(5„<,), ~ {a^)-^-\ (1.5) 

with I < p + k so that the associated posterior distributions be well defined. These interesting 
priors are improper, but they are proper with respect to 6 for fixed a. The case / = is of 
particular significance since it corresponds to the right Haar invariant case for cx^, and since the 
prior in (ii) corresponds to the truncation onto the restricted parameter space 0(m) of the usual 
noninformative prior for {6, cr^) set on 3?^ x 3?+. The Bayes estimators associated with priors in (i) 
and (ii) may hence be described as boundary uniform or fully uniform, and we denote these Sbu,i 
and Su,i respectively. 

The above priors and Bayes estimators are analogous to the estimators Sbu and Su studied by 
Marchand and Perron (2001) for the known cr^ case in model (1.1), with loss (1.2) and the restriction 



( 1.3 ), where both 6bu and 6u were proven to dominate the maximum likelihood estimator Si^\q{X) = 
( A 1)X for sufficiently small m; and where namely Sbu dominates 5j^]g(X), and hence X, under 
the simple condition m < ^/p. Furthermore, a result by Hartigan (2004) applicable for much 
more general restricted convex parameter spaces, and reviewed along with analogous findings by 
Marchand and Strawderman (2004), implies that 6u dominates X for all {m,p). 

Returning to our unknown case, Kubokawa (2005) provided various improvements on 6ub{X, S"^) = 
X for the univariate case p = I, showing (a) dominance of 6u^i for all m > and / G [0, + 1], as 
well as (b) the dominance of Sbu,i for m < 1 and I G [0, /c + 1]. Hence, these findings may be viewed 
as unknown univariate extensions of some of the existing results in the literature, including some 
described in the previous paragraph. However, Kukokawa's results are limited to the univariate 
case and do not apply to either the maximum likelihood estimator ^YnlQiX, S"^), nor to affine linear 
estimators aX, a G [0, 1). 

In this paper, we extend Kubokawa's findings concerning Sbu,i to the multivariate case p > 1 
providing sufficient conditions for dominance of both 6ub{X, S"^) and 5j^jg(X, S*^). Namely, we 
show that: (i) the Bayes estimators Sbuj, < I < k + p, dominate 6ub{X,S^) whenever m < ^/p, 



and (ii) Sbu,o dominates 5j^jg(X, S*^) whenever m < ^/p and p > 2 (Corollary 3.2). This yields 
a striking parallel with Marchand and Perron's dominance result for the known o"^ case. We also 
show that Sj^Ye always improves on 6ub, infer other dominance results, and present illustrations 
and accompanying commentary. For very small parameter spaces (precisely m < y^), we obtain 
a universal dominance result showing that a vast class of Bayes estimators, which includes 6u,i and 
Sbu,i for / < 0, dominate 6ub- 

Our methods also depart from, and arguably simplify, the methods used by Kubokawa in the 
univariate case applicable to Sbui- Key features of Kubokawa's results are techniques previously 
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introduced by Kubokawa himself (Kubokawa, 2004, Integral Expression for Risk Difference (lERD), 
as well as Marchand and Strawderman (2005). With extensions of these techniques challenging to 
obtain and elusive in the multivariate case, with the absence of results applicable to other estimators 
such as the S"^), we rather exploit a conditional risk decomposition on a maximal invariant 

T = -LiL in a similar fashion as Marchand and Perron (2001) and Moors (1985). The deficiency of 
^mle( ' '^^) '^mle^"'^' '^^) revealed as one of providing estimates that are too far from the origin. 
Even when m > y/p, we will show that improvements necessarily occur by projecting towards the 
Bayes estimator 5^(7,0- The Bayes estimator 6BUfi coincides with the best equivariant estimator 6m, 
and analytical results for Sbu,o, which we will describe and make use of, were previously given by 
Kariya, Giri and Perron (1988), as well as Perron and Giri (1990). 

The remainder of this paper is organized as follows. Section 2 contains key features and properties of 
the invariance structure and the risk function of equivariant estimators, as well as various expressions 
and key properties relative to the Bayes estimators Sbu,i and the maximum likelihood estimator 
5j^jg(X, S*^). The dominance results are presented in Section 3 along with various illustrations and 
comments. Final remarks are given in Section 4. 



2. Preliminary results 



2.1. The invariance structure and risk function 



The challenge here is to obtain good improvements or alternatives under loss (1.2) that capitalize 



on the parametric information in (1.3), as measured by the risk function 
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R{{e, a),S) = -^Ee,a [\\S{X, S) - e\\'] , {6, a) e Q (m) 



a 



(2.6) 



We provide findings for (nonrandomized) equivariant estimators, with respect to the group structure 
described in Section 1.1, and as shown by Kariya, Giri and Perron (1988), as well as Perron and 
Giri (1990), to be of the form 

6,{X,S') = h{^^)X, (2.7) 

for some measurable function h{-). Equivariant estimators are thus coUinear to X and conveniently 
represented by the corresponding multiplier h which controls the degree of expansion or shrinkage 
with respect to X, depending on (X, S^) only through the maximal invariant statistics T 



The class of such estimators include 6ub {h 



affine linear estimators aX {h 



S2 



5. 



mle 

see Lemma 2.6), generalized Bayes estimators 5bu,i (see Lemma 2.1), and more generally Bayes 



estimators with a spherically symmetric structure which includes 5uj (see Lemma 3.8). Equivariant 



estimators have a risk function in (2.6) depending on the unknown parameters only through the 
maximal invariant A = ■'^ G [0,m]. By a slight abuse of notation, we will write this risk R{X,5h)- 
Here is a useful representation for the risk -R(A, 6h) of equivariant estimators achieved by conditioning 
on the maximal invariant statistic T and highlighting the key role of the best equivariant estimator 
6xiX,S') (for M = A). 



Lemma 2.1. Under model (1.1) and loss (1.2), we have 



R{X,6,) = X' + Ex [a{T) {{h{T) - h,{T))' - hl{T)}] 



(2.8) 
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with a(T) = Ee,^ {^\T), b{T) = Ee,. ( ^ | T) , an(i h^{T) = ^ . 

Proof. The result is immediate by writing the loss as = fi'^^t^^ _|_ ^2 _ 2h{t)^ and 

decomposing the risk as E[L((^,a2), 5;,)] = E^{E[L((e,a2),5l)]|T}. " " □ 



2.8) above, the risk of 6h is constant for the restriction ^ = A and the 



Remark 2.2. As seen by 

optimal procedure is given by the BEE Sx{X, S"^) = h\{T)X . 
Kariya, Giri and Perron (1988), as well as Perron and Giri (1990) gave an explicit expression for 



the BEE 5\{X.,S'^). It is reproduced below in Lemma 2.1 where we derive an expression for the 
Bayes estimators 5buj- The estimators 5jn{,X, S'^) and 5bu,i necessarily coincide by virtue of general 
relationships between best equivariant estimators and Bayes estimators with respect to Haar right 
invariant priors (e.g., Eaton, 1989). 

2.2. Bayes estimators 

We begin here with a general expression for Bayes estimators associated with priors of the form 

TT (0, (T^) = TT^ (0) a'-^ l<k + p, (2.9) 

where 7ro-(-) is for fixed cr a (proper) density with respect to a finite measure supported on, or a 
subset of, the ball B(ma). 



Lemma 2.2. For model (1.1) and loss (1.2), Bayes estimators with respect to priors as in (2.9) 
are equal to 

S( ^\ , V.rcr^-'-^-'e-^ml{x)da' 

dT,[x,s)=x-\ , (2.10) 

Jp°° jji^ (^x) da"^ 

where Vx denotes the gradient vector with respect to x, and {2na'^)^'P/'^ rn^ is the marginal density 
of X\a with 

ml (x) = I e-5^ll"-^llV^ {6) du^{9) . (2.11) 

Proof. We have 

6 ix s^) = ^t^/"''"'^'] (2 12) 



/o" e~^""-''"'e-^ TT, (9) a^-^-^-^ du^{9) da^ 



(2.13) 



X + 



10 

The result follows since 
,2, 



a'W^ml {x) = {O-x) e'^^^'^-'^^ tt, {6) du,{e) . □ (2.14) 
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Lemma 2.2 applies to the boundary uniform and fully uniform priors in (1.5), among others. We 
pursue here with useful expressions for the former, which will also serve as a benchmark for other 
Bayesian estimators (see Lemma 3.8) in Section 3. 

Theorem 2.1. We have 6bu,i {X, S^) = h{m, /, T) X with T = 



and 



h{m, I, t) 



m 



2F 



( k+p-l _|_ -1 P I 1 m^t \ 
y 2 "T -L, 2 2{l+t) J 



P 



p m?t 
' 2' 2{l+i) 



(2.15) 



F being the confluent hypergeometric function given by F{a,b,z) = j^ff' ^ ^ 3ft, with (c)j 



ni=o(c + j) for z = 1, 2, . . and (c)o = 1. 



Here is a familiar identity (e.g., Watson, 1983), related to the normalization constant of a Langevin 
distribution and useful in the proof of Theorem |2.1[ 



(2.16) 



Lemma 2.3. For y G W, and U uniformly distributed on the sphere of radius r, we have 



EJe 



y'U] = r(-) (ll^ll^) 

2 {\\y\\r) 



\(p/2-l) 



where 1^ (■) is the modified Bessel function of order v given by 



' z y+2k 



k>0 



Proof of Theorem 2.1 , It suffices to calculate m^^ {x) = (x) for a boundary uniform prior 



TT 



as in ( 1.5[ i). We have from (2.11) and (2.16), with U uniformly distributed on S 



^ (x) 



E„ 



e 2 e E„ 



e 



_\\^\r p m\\x\. 
e 2e^r(-)(— — ) Ip/2-i{m\\x\\/ a) 



(2.17) 



Now, using the identity {t^ ^ ly (t)) = t^ " Iv+i (t), calculations lead to 



V.mf ^ (x) = ^ (x) + r(^) e-^e-^ m^'^/^ ^'^"'"^T^i V ll^ll/^) 



(2.18) 



Substituting this and (2.17) into (2.10) with an interchanging of and J (justified) yields 



i-fc-p/2-4 



m- 



X (J' " " e 2a 



ijn \\x\\/ a) da"^ 



^11 e-^^/p/2_i {m\\x\\/a) da^ 



(2.19) 
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Finally, the result follows by substituting the series expression for the Bessel functions above, 
interchanging sums and integrals, integrating out with respect to o"^, and some simplification. |^ □ 

Ratios of confluent hypergeometric functions and their properties hence play an important role here, 
as witnessed by Theorem 2.1 s representation of h{m,l,t). Furthermore, key analytical properties 
of Sbu,i (Lemmas 2.5 and 2.7), of other Bayesian estimators (Lemma 3.8) and corresponding risk 
function comparisons (Section 3) will hinge on various properties of such ratios, as those given by 
the following intermediate result. 

Lemma 2.4. For all a > 0, b > 0, z > 0, and c E {0, 1}, 

(a) the function Ka^h^d-) = ^^°^~F(t+i~b'^)'^''^ '^^ strictly decreasing on [0, oo) with lim^^oo Ka^b,c{.z) = 
0; 

(b) the function H (■) = ^p'l'.'l^^^^ is strictly decreasing on (0, oo). 
Proof, (a) We have for 2; > 

Ka,b,c (z) = (ca + (1 - c)b) E, 



1 



(ca+ (1 -c)b) + I 



where / is a discrete random variable with density proportional to ^""^y^ fr l{o,i,...}(0- Since these 
densities form a family with strictly increasing monotone likelihood ratio in I with parameter z, 
the result follows since (ca + (1 — c)b > 0, and decreases in i, i E {0, 1, . . .}. The 

limiting value is established by exploiting the representation (e.g., Abramowitz and Stegun, 1964) 
F{a, (3, z) = exp(z)z"~^ (l + O (^) ) , where O (^) is a bounded function by Mn a neighborhood 
of infinity. 

(b) Similarly, write = 1 + ^ Ea[J] , where J is a discrete random variable with mass function 

{a)j z^exp{-z} 
l'.(j)oc^^ J, l(o,w(j), 

and observe a strictly increasing monotone likelihood ratio property in J with parameter a. This 
implies the desired monotonicity of H{-). □ 

By making use of these above properties, as well as further properties of confluent hypergeometric 
functions, we derive the following analytical results concerning Bayes estimators 6bu,i given in 
Theorem 2.1 More precisely, we now describe how h{X,l,t) varies with respect to {X,l,t). 

Lemma 2.5. (a) The function h{X,l, ■) is, for X > 0, I < k + p, strictly decreasing on [0, 00) with 

sup^>o ^(-^5 ^ ^) = ^(-^5 ^ 0) = — and inft>o h(X, I, t) — 



P F '^+1,1,^ 



(b) the function h{-, I, t) is, for t > 0, I < k + p, strictly increasing on [0, 00). ■ 



Equivalently, one can proceed with an intermediate identity of the form 
for positive /x, a, T. 
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(c) the function h{\, t) is, for X > 0, t > 0, strictly increasing on (—00, k+p); and consequently 
h{X, I, t) > h{\, 0, t) for I eiO,p + k). 



0, and implies 
use 



Proof. In part (a), the monotonicity follows from part (a) of Lemma 2.4 setting c 
sup^^Qh{X,l,t) = h{X,l,0) and mit>o h{X,l,t) = limt^oo h{X,l,t) yielding the results. In (b) 
the recurrence relation 

(3F{a, (3,w) to obtain 



(e.g., Abramowitz and Stegun, 1964) wF{a + l, /3 + l,w) = l3F{a + l, I3,w) ■ 



h{X,l,t) 



l + t 
t 



' p+k—l p ,\ 
2 ' 2 ' 



' p+k—l 



+ 12 
^ 2' 



t) 



The increasing property then follows from part (a) of Lemma 2.4 setting a 



p+k-l 



2' 



1. 

□ 



Finally, part (c) is a consequence of part (b) of Lemma 2.4 and expression (2.15). 

Remark 2.3. As shown above in Lemma 2.5, h\{t) increases in I so that the amount of shrinkage 
or expansion of the boundary uniform estimators is controlled by the choice of the I in the prior 
(1.5). Namely, in comparison to the benchmark 6bu,o, ^bu,i will either expand for k + p > I > 0, 
or shrink for I < with the case I — )• —00 leading to Sbu,i shrinking to the degenerate estimator 
6 = 0. These properties will lead directly to precise risk comparisons in Section 3. We limit I 
to be less than p + k in order that the posterior distribution exist for all (x,s^). Interestingly for 
/ G [p + k,p + k + 1), the expansion of (2.12) holds even though the associated distribution is not a 
probability distribution. 



2.3. The maximum likelihood estimator 



In the same context, we determine the maximum likelihood estimator 5j^ig and some of its analytical 
properties. 



Lemma 2.6. For model (1.1) and parameter space Q{m) in (1.3), the maximum likelihood estimator 
of 6 is given by 6^i^{X, S"^) = h^i^{T)X , with T = ^^^p- and 

W^)= l7T7:^l\/l + 4 "i:/^^r -ll All . (2.20) 



2{p + k) 



t 



Proof. According to (1.1), the loglikelihood is given by 

1 .„ 



\nL{e,a) oc (ll^^ 



/i||2 I 2\ P + k 2 

(^W + S ) 7. — llicr^ 



For fixed a, the likelihood with respect to 6 is maximized for 

X if ||x|| < mcr 

ma-n^ if llxll > ma . 

\\x\\ II II — 

Therefore, we have sup(g o.)g0(„) L {6, a) = sup^-^o L ( 6'o-, a j . Next, we can see that 



— In L ( ^o-, o" ) oc 



2^2 



X - 9n 



if llxll < mcr 



^ + 2±^lna2 + i fm2 + %i^-2mM) if||x||>ma, 



Now, the minimum of — lnL^^o-,o"j is attained on y^,ooj whenever > 
0-^-[q{x,s) = X whenever t 
— In L ( ^o-, cr ) is attained at 



m2 



Therefore, 



^ < On the other hand, when t > the minimum of 

— p+k ' p+K ' 



m \\x\ 



2{p + k) 



1 + 4 



(k + p) 



\x\ 



+ 1-1 



Finally, since 6'j^ig(a;, s) = mp||Cro for such values of (x, s), the result follows. □ 

Remark 2.4. Observe that \\5^1q{x, s'^)\\ < \\x\\ with strict inequality for ^ > so that the 
maximum likelihood shrinks the unbiased estimator towards the origin. Moreover, the amplitude 
of this shrinkage as measured by h^^^it) is seen to be increasing with respect to t (i.e., h^j^J^t) 

decreases in t), with limi^oo h^ie{t) = , 7 = 

Now, for various risk differences involving the estimators Sj^^^, Sub, and the Bayes estimators Sbu,i, 
the following comparisons will be pivotal. 

2 

Lemma 2.7. (a) We have h{m, 0, t) < h^i^{t) for t > whenever p > 2, k > 2, and whenever 
k = 1 and m < ^/p. 

(b) For p > 2 and for p = l,k = 1, the condition m < ^/p is necessary and sufficient so that 
h{m,0,t) < h^i^it) for all t > and, in such cases, equality is attained if and only if t = 
and m = ^/p. 

(c) Whenever m < ^/p and I < p + k, we have h{m, l,t) < 1 for allt > with equality if and only 
ift = and m = ^/p. 



Proof. Part (c) is a direct consequence of part (a) of Lemma 2.5 Since h-^^ei') constant on 

2 2 2 

[0, ^^], and h{m, 0, ■) is decreasing and bounded above by ^ on [0, by virtue of Lemma 



[U, ^^J, and h[m, (J, ■) is clecreasmg and bounded above by ^ on [U, by virtue ot Lemma 2.5 
part (a) implies part (b). For proving part (a), by making use of representations (2.20) and (2.15^ 
it suffices to verify that 



R{z) = 



F(^ + l,f + l,z) 
F{'^ + l,l,z) 

F(^ + l,| + l,z) 



< 



p 



2{p + k) 



1^ I 2(fe + P) 



z 

> - 
p 



for all p > ^, ft; ^ z,, 
decomposition 



> 2,k > 2,z > OT for k = 1, m < ^ and z = ^ t [ 2(^2+^+^) , ^ 



(2.21) 
With the 



^ '2' ) ^ (2 + lj, 



+ 1 



P ' J^- 



\ y j>o 
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one obtains the representation 



R(z) = 1 + -f3{z) 
P 



2z 1 
1 + —(1 + kEJ = 1), 



(2.22) 



with Ez representing expected value with respect to a discrete random variable J with probability 



mass function proportional to ^E+iy^ §7 l{o,i,...}- With a similar expansion, we also have 



z{/3{z) 



kEJ 



J 



(2.23) 



Using (2.22) and (2.23), (2.21) becomes equivalent to 



l+f/3(^)>!(Vl + 



2{k+p) 



+ 1) 



where 



A{z) = ^ {4/3(^)(/3(^) - l)z^ + 4p(/3(^) - l)z + p'} > z 



T{z) = rimim - 1)^ + pim - 1)} • 



(2.24) 
(2.25) 

(2.26) 



Now, by making use of (2.22) and (2.23), expand T(-) in terms of two independent copies Ji and 
J2 of 2 J as 



T{z) 



1 + kEz 



1 



■p + 2 + Ji' 



E., 



J, 



k + p+ J2 



+ Ez 



2p 



'p + 2 + Ji 



E., 



2p{k + p) + JiJ2 + {p + 2) J2 + ik + 2p) J2 



_{p + 2){k + p) + J1J2 + ip + 2)J2 + {k+p)Ji_ ■ 

In the above, we have expressed the product of expectations as the expectation of a product given 
that the Jj's are independent. Given (2.25), it will suffice to show that T{z) is lower bounded by 1 
for all 2; > 0. Since p > 2 by assumption, we obtain with an expansion where we denote p the joint 
probability function of (Ji, J2) and d{ji,j2) = {p + 2){k + p) + jija + (p + 2)j2 + {k + p)ji, 

'{p + 2){k + p) + J1J2 + (p + 2) J2 + {k + p)J2 



Tiz) > E, 



Xp + 2){k + p) + J1J2 + (p + 2) J2 + (fc + p)Ji_ 

J2 — Ji 



1 + {k + p)E, 

l + + + E 

i2<ii j2>ji 

l + (fc+p)(^(j2-J 



(p + 2){k + p) + J1J2 + {p + 2)J2 + {k + p)Ji 
(j2 - Ji)p(ji,j2) 



32<jl 



P{j 1,32) P(j2,J 



diji,j2) d{j2,ji) 



i + {k+p){k~2)j2pUi,j2] 



(J2-J1)' 



J2<J1 



d{j2,ji)d{ji,j2) 



> 1 



since Ji and J2 are independent and for k >2. This establishes the result for all {m,p, k) such that 
p > 2 and k > 2. Finally, for k = 1 and m < y^, it is clear that (2.24) is verified for z < /2. 
This completes the proof. 



We point out that we have shown inequahty (2.21) for all p > 2, fc > 2, z > 0. We could not find a similar 
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3. Dominance results 



Following Lemma 2.1 s representation of risks, and the analytical properties of h{m, I, ■) and ^nile(') 
worked out in Lemmas 2^ and 2/7, we obtain various risk comparisons and dominance results which 
will be consequences of the following. 



Theorem 3.2. Consider equivariant estimators Sh as in (2.7) for estimating 6 for model (1.1), loss 



(1.2) and the parameter space Q{m). 



(a) A sufficient condition for to dominate 5h^ (6h2 7^ ^hi) is hi{t) > h2{t) and 
h{m,0,t) for all t > 0; 



fe2(f)+fel(t) 



> 



(b) Let S = {t : hi(t) > h{m,0,t)}. Let S_ be a subset (not strict) of S such that z/(S) > 
0, where v is the Lebesgue measure on M, then 6hi is inadmissible, and dominated by 6h 
with h{t) = hi(t) A h{m,0,t), as well as by any Sh, h{t) = hi(t)ls^(t) + h*(t)ls(t) with 
2h{m,0,t) - hi (t) < h* (t) < (t) for all t E S. 

(c) If hi{t) > h{m,0,t) for all t > 0, hi{-) ^ h{m,0,-), then the boundary uniform Bayes 



estimator 6 BU,o given in (2.15) dominates 5h^ 



Proof. Parts (b) and (c) are consequences of part (a) by setting h2 = h (as given) and /i2( 



h{m,0,-) respectively. From (2.8), we obtain 



EA[a(T){(/ii(T)-/i2(T)) X (/ii(T) + /i2(T)-2/i(A,0,T)}] (3.27) 
[a(T) A;, (T)] (say) , (3.28) 



where a(-)(> 0) is given in Lemma 2.1 Therefore, the conditions on hi and h2, along with part 



(b) of Lemma 2.5 which implies h(X, 0, ■) < h{m, 0, ■) for A < m, force Aa(T) > with probability 



one for all A G [0,m]. Finally, dominance indeed occurs with Ax{t) > for all A G [0,m) and 
t e {u : hi{u) > h2{u)}. □ 

Corollary 3.1. (a) For {m,p,k) such that p >2,k>2 or k = l,m < ^/p, dominates the 

unbiased estimator X . 

(b) For m < 6bu,o dominates S^i^ whenever p >2 or {p, k) = (1, 1). 

(c) Whenever m < ^Jp and / G [0, /c + p), ^bu,i dominates X. 

(d) For all {m,p) and I G {0,k + p), 6bu,i is inadmissible and dominated by Sbu,o- 

(e) For all {m,p) with m > ^/p, such that p > 2, k > 2, the estimator S^j^^ is inadmissible and 
dominated by the truncation 6h, with h{t) = h^i^{t) A h{m,0,t). |^ 



Proof, (a) We apply part (b) of Theorem 



3.2 



with hi 



1, S 



Recall that /ij-Qig(t) < 1 for t > as pointed out in Remark 



2.4 



(g,oo), and h* = h^Ye- 
Under the given conditions 



:, oo), and h* 



or as sharp as inequality in the Uterature. Such an inequahty, and the technique used to estabhsh it, may weU 
be of independent interest as various bounds for ratios of special functions like the hypergeometric here have been 
previously the subject of study (e.g., Joshi and Bissu, 1996; Kokologiannaki, 2012.) 



In both (d) and (e), further dominating procedures can be derived using part (b) of Theorem 3.2 
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on {rrijPjk), part (a) of Lemma 2.7 tells us that S = {t : 1 > h{m,0,t)} contains S_ and that 



h{m,0,t) < /ijj^ig(^) < hi(t) for all t G S_, from which the result follows. 

(b) Under the assumptions on {m,p,k), part (b) of Lemma 2.7 tells us that ^nile'^') — ^("^j 0; 



and the result is hence an immediate consequence of part (c) of Theorem 3.2 



(c) Parts (c) of Lemma 2.5 and Lemma 2.7 imply that h{m,0,t) < h{m,l,t) < 1 for all t > in 
cases where m < ^/p and / G [0,p + k). The result then follows as an application of part (b) of 
Theorem [3^ with hi = 1, h*{-) = h{m,l,-), and S = S = (0,oo). 

(d) Given part (c) of Lemma 2.5, the result follows as an application of part (c) of Theorem 3.2 

(e) Follows from part (b) of Theorem 3.2 with hi{-) = h.^ip{-), h*{-) = h{m,0,-) and S_ = S = 



h-A:' 



oo), and by making use of part (a) of Lemma 



2.7 



□ 



Remark 3.5. The dominance results in part (c) of Theorem 3.1 were obtained by Kubokawa (2005) 
in the univariate case, and constitute an extension to the multivariate case p > 1 for improving on 
5uB- However, with part (a) demonstrating the superiority of 6^^^ on 5ub, the more important 
finding is the dominance of 6bu,o over S^i^ for m < ^/p and p > 2^ For cases where m > y/p, 
p,k > 2, is still inadmissible and explicit improvements are available from part (e). We believe 
and conjecture that these results extend to the univariate case, but we have been unable to establish 



the inequality in part (a) of Lemma 2.1, which is a critical element of the analysis. 



Example 3.1. Part (b) of Theorem 3.2 indicates clearly that equivariant estimators 6h with h 
taking values that are too large on a subset of M"^ are inefficient and can be improved upon by 
projecting towards the benchmark h{m,0,-). Otherwise said, the function h{m,0,-) provides an 
upper envelope for a complete class of estimators. Key applications are explicited in Corollary 
3.1, leading to improvements on the unbiased estimator, the maximum likelihood estimator, and 
the generalized Bayes estimators Sbu,i with < I < p + k. Moreover, the degree of expansion 
translates to even greater inefficiency as is the case for the unbiased estimator. As an example, 
taking {p,k,m) = (5,10,2), Figure 1 shows the multipliers and risk functions of 6ub, ^rnle ^"'^ 
^BUfl o-s a function of X = G [0,2]. Here m < ^fp, so that the ordering of the multipliers and 
risks is clearly dictated, namely by Lemma 2.1 and Corollary 3.1. We see that huB = I is much too 
large, with very poor risk performance. The estimator S^i^ fares better but the gains provided by 
the Bayes estimator 6 bu,o ore nevertheless important; in relative terms ranging from a maximum of 
over 50% at X = to a minimum of around 12% at the boundary A = 2. 



Example 3.2. Further risk function comparisons are presented in Figure 2 for other combinations 
of {p,k,m). In both cases, we do not have m < ^ and the dominance findings for the boundary 
Bayes estimator Sbu,o do not apply, and there is also no guarantee that it performs satisfactorily 
from a frequentist risk point of view, even in comparison to the unbiased estimator. In one of the 
cases (with p = 5 and m = 3), the numerical results indicate that 6bu,o performs quite well in 
comparison to S^i^ with dominance and relative gains between 5% and 20%. But notice how the 
gains are less impressive than in Figure 1 where m is smaller and the dimension p = 5 is the same. 
In the other case, where the ratio ofm relative to p is much larger, not only do the findings not apply. 



notwithstanding the dominance result applicable to the truncation given in part (e) of Corollary 3. 1 
but the risk performance of Sbu^ is arguably quite poor. More research is thus required on alternative 
Bayes estimators, especially when m > ^/p. The deficiency of 6bu,o lies in the fact that it expands 
too much. Here, as an example, we have m?/p = (3^)/3 = 3 so that Sbu,o{x,s^) ~ 3x when x is in 
a neighbourhood of 0, which leads to poor estimates when ||^||/cr is small. Other priors with I < 



'For completeness, we have also summarized the situation in the very special case p = k = 1. 
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(p,k,m)=(5,20,2) 



[p,k,m)^(5,2D,2] 




Figure 1: Multipliers and risks of 5ub, (^nile ^^"^ ^BUfi for (p, /c, m) = (5, 20, 2). 



will shrink 6bu,o towards the origin. One such class of choices, studied in the known variance case 
by Fourdrinier and Marchand (2010), are obtained by taking tTo- in (2.9) to be a uniform density 
on the sphere of radius ao , for all a > with a G [0,m). Of course, for such priors. Theorem 



2.15 provides an expression for the Bayes estimator by replacing m by a. Here, we tried one such 
choice with a = 2.5 and the numerical evaluation indicates quite satisfactory performance with a 
significant improvement on 6bu,o near the centre and on a large part of the parameter space, with a 
slightly worse performance near or on the boundary. 




Figure 2: Risks of various estimators : unbiased (Sub), maximum likelihood {S-^Iq), boundary 
uniform {Sbu,o) and Bayes with 9\a uniform on sphere of radius aa (5a,o); for {p,k,m) = (5,20,3) 
and (p, k, m, a) = (3, 20, 3, 2.5) 

We conclude with a general representation for Bayesian estimators and a universal dominance result 
applicable to very small parameter spaces (where the average squared signal to noise ratio is less 
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than or equal to 1/2) which focusses quite clearly on the inadequacy of the unbiased estimator and 
lies in continuity with its inefficiency among affine linear estimators described in the introduction 
and the above risk comparisons. 



Lemma 3.8. For priors as in (2.9) with spherically symmetric densities tt^ for all a, Bayes esti- 
mators are equivariant, of the form 6t^{x, s^) = h„{t) x, with (i) < h^^lt) < h{m, I, t) for all t > 0, 
and with (ii) h.,^{-) < h{m, 0, ■) for all I < 0. 



Proof. Part (ii) is immediate from part (i) and part (b) of Lemma 2.5 For (i), ffist observe 
that Bayes estimators in (2.12) are also posterior expectations £',r*(6'|x, s^) for prior measures 
7i*{6,a) = 7Tcr{0)a''^^ (as expressed in equation 2.13). Now, proceeding as in Marchand and Perron 



(2001, Theorem 4), with spherically symmetric choices, the prior densities vr^ in (2.9) admit the 
representation 9\a RU, where R is supported on [0,m], U is uniformly distributed on the sphere 
5*0-, and R and U are independent (conditional on a). Hence, by making use of the developments 
in Theorem [2l| we have S^{x,s^) = E^,{e\x,s^) = E[RE{U\R, x, s^)] = E[h{R,l,t)\x, 
h{r,l,t) is given in (2.15) and is the posterior distribution of ||^ 



X, where 
Since this posterior dis- 
tribution is supported on [0,m], the result follows since h{-,l,t) is nonnegative and increasing for 
all {l,t) by virtue of part (a) of Lemma 2.5 



□ 



Theorem 3.3. Whenever m < y^, all equivariant estimators 5/i(a;, s^) = h{t)x with < h{t) < 
h{m, 0, t) for allt > dominate the unbiased estimator X . These include all Bayes estimators with 



respect to a prior n as in (2.9) with spherically symmetric densities ti„ for all a and I < 0. 



Proof. Applying part (a) of Theorem 3.2 with hi = 1 and h2 = h with the given assumptions on 
h, m and p, we have 

h(t) + 1 1 

^ >^>y>him,0,t) 



for all t > 0, with the rightmost inequality a consequence of part (a) of Lemma 2.5 The proof 
is complete by observing that the inclusion of the Bayesian estimators among the dominating 
estimators is a consequence of Lemma 3.8 □ 



Remark 3.6. The same result holds if the variance a is known. This corresponds to the setting 
studied by Marchand and Perron (2001) and the result may be established along the same lines and 
using similar developments given in their paper. They actually establish a universal dominance 
result (also see Fourdrinier and Marchand, 2010) but it applies for the more challenging problem 
of dominating the maximum likelihood estimator. All in all, the result here and the other domi- 
nance results involving the unbiased estimator may not be that surprising given that even the trivial 
estimator 5 = dominates X whenever m < y^. But, we have still provided a method of proof 
applicable to a very large class of Bayesian estimators. And, our findings elsewhere relate as well 

^mle- 



4. Concluding Remarks 

For estimating a multivariate normal mean with an upper bounded signal to noise ratio we have 
provided dominance results which can viewed as both multivariate extensions of results obtained 
by Kubokawa (2005), as well unknown variance extensions of results obtained by Marchand and 
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Perron (2001). In opposition to similar extensions for Stein estimation, the presence of the unknown 
scale here leads to challenges in describing Bayes estimators and some of their analytical properties 
which ultimately relate to frequentist risk performance. 

We have focussed mostly on boundary Bayes estimators and the benchmark estimators that are 
improved upon are those obtained from the principles of unbiasedness and maximum likelihood. 
As illustrated theoretically and numerically, and analogously to the known a case, the relative 
merits of the boundary Bayes procedure seem to fairly well correlate with the ratio of the radius m 
relative to y/p (see as well Marchand and Perron, 2001; Fourdrinier and Marchand, 2010; Kortbi 
and Marchand, 2012). 

More research is required to assess the performance of other Bayes estimators and namely to propose 
more attractive choices when m is larger relative to Such alternatives include fully uniform 
Bayes estimators. In this regard, an interesting question is whether the estimator 5u^q improves for 
all (m, p) on X with an affirmative answer representing an unknown variance extension of Hartigan's 
(2004) result in the particular case of balls. 

Several other related problems or issues are also of interest. As, an example, our findings do not 
address directly the issues of minimaxity and admissibility of Bayesian estimators. But it seems 
plausible and we conjecture that 5bu,o is minimax for small enough m as suggested by its risk in 
Figure 1 with the maximum risk attained on the boundary and since 6bu,o is quite likely a candidate 
to be an extended Bayes procedure. 

Finally, we point out that the results obtained here are applicable to two-sample problems with addi- 
tional information as described by Marchand and Strawderman (2004) and Marchand, Jafari Jozani 
and Tripathi, 2012. These involve independently distributed observables Xi ~ Np{9i,a''^),X2 ~ 
Np{92,cr''^), S'"^ ~ <y''^xh the objective of estimating 9i with the additional information that 
< m'. To achieve this, one "rotates" Xi and X2 to the independent coordinates X = 
(Xi - X2)/2 and W = {Xi + X2)/2 and considers estimators of 9i of the form S^{X,W, S''^) = 
W + ip{X,S''^) showing that dominates 5^^ estimating 9i under loss ^^"^"2^^^ with the ad- 
ditional information that < m' if and only if 'ijji(X,S'^) dominates 'ijj2iX,S'^) for es- 



timating 9 = E{X) = under loss 2^^^ and constraint — < m, for model (1.1) with 



X ~ Npi9,a^ = ^), 52 = 1^ and m = ^. 
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